Slot migration improvement #445

PingXie · 2024-05-07T00:14:41Z

Overview

This PR significantly enhances the reliability and automation of
the Valkey cluster re-sharding process, specifically during slot
migrations in the face of primary failures. These updates address
critical failure issues that previously required extensive manual
intervention and could lead to data loss or inconsistent cluster states.

Enhancements

Automatic Failover Support in Empty Shards

The cluster now supports automatic failover in shards that do not
own any slots, which is common during scaling operations. This
improvement ensures high availability and resilience from the
outset of shard expansion.

Replication of Slot Migration States

All CLUSTER SETSLOT commands are now initially executed on replica
nodes before the primary. This ensures that the slot migration state
is consistent within the shard, preventing state loss in the event of
primary failure. A new timeout parameter has been introduced, allowing
users to specify the duration in milliseconds to wait for replication
to complete, with a default set at 2 seconds.

CLUSTER SETSLOT slot { IMPORTING node-id | MIGRATING node-id | NODE node-id | STABLE } [ TIMEOUT timeout ]

Recovery of Logical Migration Links

The update automatically repairs the logical links between source
and target nodes during failovers. This ensures that requests are
correctly redirected to the new primary in the target shard after
a primary failure, maintaining cluster integrity.

Enhanced Support for New Replicas

New replicas added to shards involved in slot migrations will now
automatically inherit the slot's migration state as part of their
initialization. This ensures that new replicas are immediately
consistent with the rest of the shard.

Improved Logging for Slot Migrations

Additional logging has been implemented to provide operators with
clearer insights into the slot migration processes and automatic
recovery actions, aiding in monitoring and troubleshooting.

Additional Changes

cluster-allow-replica-migration

When cluster-allow-replica-migration is disabled, primary nodes that
lose their last slot to another shard will no longer automatically become
replicas of the receiving shard. Instead, they will remain in their own
shards, which will now be empty, having no slots assigned to them.

Fixes #21.

…ding process, specifically during slot migrations in the face of primary failures. Signed-off-by: Ping Xie <pingxie@google.com>

Signed-off-by: Ping Xie <pingxie@google.com>

codecov · 2024-05-07T00:26:22Z

Codecov Report

Attention: Patch coverage is 82.82443% with 45 lines in your changes are missing coverage. Please review.

Project coverage is 68.89%. Comparing base (93f8a19) to head (fa0285c).

Additional details and impacted files

@@             Coverage Diff              @@
##           unstable     #445      +/-   ##
============================================
+ Coverage     68.43%   68.89%   +0.45%     
============================================
  Files           109      109              
  Lines         61681    61785     +104     
============================================
+ Hits          42214    42565     +351     
+ Misses        19467    19220     -247

Files	Coverage Δ
src/commands.def	`100.00% <ø> (ø)`
src/debug.c	`53.47% <100.00%> (+0.65%)`	⬆️
src/networking.c	`85.08% <100.00%> (ø)`
src/rdb.c	`76.22% <100.00%> (-0.05%)`	⬇️
src/replication.c	`86.01% <100.00%> (-0.29%)`	⬇️
src/server.c	`88.60% <100.00%> (+0.47%)`	⬆️
src/blocked.c	`91.80% <91.66%> (-0.06%)`	⬇️
src/cluster_legacy.c	`83.16% <81.11%> (+8.46%)`	⬆️

... and 12 files with indirect coverage changes

enjoy-binbin · 2024-05-07T10:27:46Z

i see the (squash merge) commit message is missing (Signed-off-by footer is also missing)

srgsanky · 2024-05-08T02:37:13Z

Does any of the CI runs do make test-cluster? We might want to enable cluster tests in at least one run @madolson

I am able to consistently get a test failure after this commit. @PingXie can you please take a look?

./runtest-cluster --single tests/12-replica-migration-2

madolson · 2024-05-08T02:38:45Z

@srgsanky We're running most of the cluster tests as part of #442.

After READONLY, make a cluster replica behave as its primary regarding returning ASK redirects and TRYAGAIN. Without this patch, a client reading from a replica cannot tell if a key doesn't exist or if it has already been migrated to another shard as part of an ongoing slot migration. Therefore, without an ASK redirect in this situation, offloading reads to cluster replicas wasn't reliable. Note: The target of a redirect is always a primary. If a client wants to continue reading from a replica after following a redirect, it needs to figure out the replicas of that new primary using CLUSTER SHARDS or similar. This is related to #21 and has been made possible by the introduction of Replication of Slot Migration States in #445. ---- Release notes: During cluster slot migration, replicas are able to return -ASK redirects and -TRYAGAIN. --------- Signed-off-by: Viktor Söderqvist <viktor.soderqvist@est.tech>

Fixes a regression introduced in PR #445, which allowed a message from a replica to update the slot ownership of its primary. The regression results in a `replicaof` cycle, causing server crashes due to the cycle detection assert. The fix restores the previous behavior where only primary senders can trigger `clusterUpdateSlotsConfigWith`. Additional changes: * Handling of primaries without slots is obsoleted by new handling of when a sender that was a replica announces that it is now a primary. * Replication loop detection code is unchanged but shifted downwards. * Some variables are renamed for better readability and some are introduced to avoid repeated memcmp() calls. Fixes #753. --------- Signed-off-by: Ping Xie <pingxie@google.com>

enhances the reliability and automation of the Valkey cluster re-shar…

091303a

…ding process, specifically during slot migrations in the face of primary failures. Signed-off-by: Ping Xie <pingxie@google.com>

PingXie mentioned this pull request May 7, 2024

Slot migration improvement #245

Closed

Add missing test file

fa0285c

Signed-off-by: Ping Xie <pingxie@google.com>

madolson approved these changes May 7, 2024

View reviewed changes

PingXie merged commit 6e7af94 into valkey-io:unstable May 7, 2024
18 checks passed

PingXie mentioned this pull request May 8, 2024

12-replica-migration-2 fails with UNBLOCKED force unblock from blocking operation, instance state changed (master -> replica?) #463

Closed

zuiderkwast mentioned this pull request May 13, 2024

Make cluster replicas return ASK and TRYAGAIN #495

Merged

This was referenced Jun 2, 2024

Replicate slot migration states via RDB aux fields #586

Merged

[NEW] Server driven slot migration #587

Open

This was referenced Jul 7, 2024

Avoid shard id update of replica if not matching with primary shard id #573

Open

Regression from PR #445 Incorrectly Allows Slot Ownership Updates via Replica #753

Closed

Ensure only primary sender drives slot ownership updates #754

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Slot migration improvement #445

Slot migration improvement #445

PingXie commented May 7, 2024 •

edited by enjoy-binbin

Loading

codecov bot commented May 7, 2024 •

edited

Loading

enjoy-binbin commented May 7, 2024

srgsanky commented May 8, 2024

madolson commented May 8, 2024

Slot migration improvement #445

Slot migration improvement #445

Conversation

PingXie commented May 7, 2024 • edited by enjoy-binbin Loading

Overview

Enhancements

Automatic Failover Support in Empty Shards

Replication of Slot Migration States

Recovery of Logical Migration Links

Enhanced Support for New Replicas

Improved Logging for Slot Migrations

Additional Changes

cluster-allow-replica-migration

codecov bot commented May 7, 2024 • edited Loading

Codecov Report

enjoy-binbin commented May 7, 2024

srgsanky commented May 8, 2024

madolson commented May 8, 2024

PingXie commented May 7, 2024 •

edited by enjoy-binbin

Loading

codecov bot commented May 7, 2024 •

edited

Loading